Migemo: Incremental Search Method for Languages with Many Character Faces

نویسندگان

Satoru Takabayashi

Hiroyuki Komatsu

Toshiyuki Masui

چکیده

We introduce a new incremental search method called Migemo for languages with many character faces. Migemo performs the incremental search by dynamically expanding the input pattern into a compact regular expression which represents all the possible words that match the input pattern. We show that Migemo is useful not only for searching texts in Japanese and other East Asian languages, but also for performing sophisticated searches on ASCII-only documents.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Doctor ’ s Thesis Synthetic Assistance for Creation and Communication of Information

As the Internet becomes popular, circulation of information increased rapidly and many researchers have actively studied how to create and share information effectively. Creation and sharing of information can be seen as a continuous process of 1) how to find necessary information, 2) how to arrange the information, and 3) how to share the information with others. We consider that search and co...

متن کامل

SriShell Primo: A Predictive Sinhala Text Input System

Sinhala, spoken in Sri Lanka as an official language, is one of the less privileged languages; still there are no established text input methods. As with many of the Asian languages, Sinhala also has a large set of characters, forcing us to develop an input method that involves a conversion process from a key sequence to a character/word. This paper proposes a novel word-based predictive text i...

متن کامل

A Hybrid Meta-Heuristic Method to Optimize Bi-Objective Single Period Newsboy Problem with Fuzzy Cost and Incremental Discount

In this paper the real-world occurrence of the multiple-product multiple-constraint single period newsboy problem with two objectives, in which there is incremental discounts on the purchasing prices, is investigated. The constraints are the warehouse capacity and the batch forms of the order placements. The first objective of this problem is to find the order quantities such that the expected ...

متن کامل

Substring-based unsupervised transliteration with phonetic and contextual knowledge

We propose an unsupervised approach for substring-based transliteration which incorporates two new sources of knowledge in the learning process: (i) context by learning substring mappings, as opposed to single character mappings, and (ii) phonetic features which capture cross-lingual character similarity via prior distributions. Our approach is a two-stage iterative, boot-strapping solution, wh...

متن کامل

Arabic Hand Written Character Recognition Using Modified Multi-Neural Network

Hand written recognition is an interesting area of current artificial intelligence and advanced computing’s researchers. The complexity of the language controls the ability and the challenge of recognition its characters, whereas this complexity and uncertainty becomes multiplied. The use of Latin languages like English, or Spanish, limits the uncertainty because of the limited structure of the...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2001

Migemo: Incremental Search Method for Languages with Many Character Faces

نویسندگان

چکیده

منابع مشابه

Doctor ’ s Thesis Synthetic Assistance for Creation and Communication of Information

SriShell Primo: A Predictive Sinhala Text Input System

A Hybrid Meta-Heuristic Method to Optimize Bi-Objective Single Period Newsboy Problem with Fuzzy Cost and Incremental Discount

Substring-based unsupervised transliteration with phonetic and contextual knowledge

Arabic Hand Written Character Recognition Using Modified Multi-Neural Network

عنوان ژورنال:

اشتراک گذاری